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ABSTRACT 

In this paper we study the possibility of detecting lensing signals in high-resolution and 
high-sensitivity CMB experiments. At scales below 1 arcmin, the CMB background 
is dominated by the Sunyaev-Zel'dovich effect in clusters and by Ostriker-Vishniac 
effect distortions elsewhere. Assuming the Sunyaev-Zel'dovich component in clusters 
can be removed, we focus on the Ostriker-Vishniac effect and study the possibility 
of its detection while paying special attention to contaminants, such as instrumental 
noise and point sources. After designing an optimal filter for this particular lensing 
signal we explore the signal-to-noise ratio for different scenarios varying the resolution 
of the experiment, its sensitivity, and the level of contamination due to point sources. 
Our results show that the next generation of experiments should be able to do new 
and exciting science through the lensing effect of the Ostriker-Vishniac background. 

Key words: galaxies:clusters:general; methods:data analysis; dark matter 



• 1 1 INTRODUCTION 



Future high-resolution CMB or microwave experiments 
will focus on secondary anisotropies such as the Sunyaev- 
Zel'dovich effect (SZ hereafter, Sunyaev & Zel'dovich 1972), 
the Ostriker-Vishniac effect (OV hereafter, Ostriker & Vish- 
niac 1987) effect or the lensing effect. Both the SZ effect 
and the OV effect occur when free electrons interact with 
CMB photons. During this interaction, photons gain en- 
ergy from the free electrons (this mechanism is known as 
inverse Compton scattering). This gain of energy creates 
a distortion in the energy spectrum of the CMB which is 
proportional to the temperature of the ionized plasma and 
its density (thermal SZ effect). Also, if the plasma cloud is 
moving with a bulk velocity v, this velocity creates an ad- 
ditional Doppler shift in the energy of the CMB photons 
(kinetic SZ). Historically the thermal and kinetic SZ effects 
have been studied in galaxy clusters. The OV effect is sim- 
ilar to the kinetic SZ effect but focusing on the first free 
electrons after re-ionization at high redshift. The Universe 
is believed to be fully ionized at redshifts z < 10 (e.g Loeb 
& Barkana 2001, Kohler, Gnedin & Hamilton 2007). At this 
high redshifts the temperature of the ionized medium is too 
low so the equivalent of the thermal OV can be neglected. 
High-resolution, high-sensitivity experiments should be able 
to see all three effects. The thermal SZ effect is the domi- 
nant one followed by the kinetic SZ. The thermal SZ effect 
can be easily identified due to its frequency dependence. It 
should be, in in principle, relatively easy to subtract this 
signal from the maps (see for instance Herranz et al. 2002). 



The kinetic SZ can be also removed (at least a big part of 
it) using the spatial distribution of the thermal SZ and the 
fact that both effects are proportional to the optical depth 
of the cluster. On the other hand, the OV effect spreads over 
the entire field of view while the SZ effect is concentrated 
only in galaxy clusters. If one observes a small portion of the 
sky (few arcmin 2 ) it is very possible that none or very few 
clusters will be present in the image and the OV effect may 
be the dominant anisotropy even before subtraction of the 
SZ effect. Having a high redshift uniform background like 
the OV effect is particularly interesting for lensing studies. 
If a massive object (for instance a cluster) is in the field of 
view, it will distort the OV effect creating an elongation on 
the anisotropies in a direction perpendicular to the line con- 
necting the cluster and the anisotropy (in the strong lensing 
regime this elongation can be parallel to this direction, i.e. 
radial arcs). Lensing effect of the OV background opens a 
new door to study the distribution of high redshift large scale 
structures. In particular, it may be a powerful tool to study 
the evolution of galaxy clusters detecting proto-clusters at 
high redshift through their lensing effects. Future experi- 
ments should look for this signal as it offers a unique way to 
study the high redshift universe. In the mm band, there are 
several experiments currently under development, ALMA 
(Wootten 2002), ACT (Kosowsky 2003), SPT (Ruhl et al. 
2004). Although is very possible that these experiments do 
not reach the required sensitivity they will be important to 
address some of the issues discussed below, such as contam- 
ination by point sources. Also, high sensitivity, high resolu- 
tion radio experiments will complement microwave observa- 
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tions by looking at the "holes" in the 21-cm line maps (Wang 
et al. 2006), LOFAR (Rottgering 2003), SKA (Carilli 2004). 
A technique similar to the one presented in this paper could 
be applied but using the 21-cm line as a background at even 
higher redshifts (Koopmans et al. 2004, Zahn & Zaldarriaga 
2006). While effects like the Sunyaev-Zel'dovich effect have 
attracted much of the attention, little has been done regard- 
ing the OV effect (Jaffe & Kamionkowski 1998, Scannapieco 
2000, McQuin et al. 2005, Iliev et al. 2006) in part due to 
the difficulties found when simulating the OV effect. One of 
the main sources of uncertainty comes from the poor con- 
straints on the re-ionization history (usually parametrized 
by the fraction of ionized material as a function of redshift 
X e (z)). Two kinds of constraints have been imposed so far. 
On one hand, studies of the Ly-a forest have found that 
the Universe is completely reionized from redshift z = to 
z ?s 6. Observations of the Gunn-Peterson trough show that 
the end of the reionization era is at z ~ 6 (Fan et al. 2002, 
Kashikawa et al. 2006). On the other hand, WMAP polar- 
ization measurements of the cosmic microwave background 
imply that reionization started at z — lO.QjJ's (Page et 
al. 2006). This constrain on the beginning of reionization 
was derived assuming that reionization was instantaneous. 
If reionization was a gradual process, then the starting point 
could be as high as z w 20. The higher the starting point 
of reionization the denser the Universe was at that point 
and the strongest the OV distortions. Moreover, it is still 
uncertain how the different scenarios of patchy reionization 
affect the final distribution of OV anisotropies. The resulting 
uncertainty in the level of OV anisotropies (angular power 
spectrum) is about one order of magnitude (e.g Santos et al. 
2003). 

Regarding lensing observations, most of them have fo- 
cused on optical observations. Particularly interesting are 
strong lensing studies of galaxy clusters since in this case 
it is possible to have multiple arcs systems coming from 
sources at different redshifts (Broadhurst et al. 1995, 2005a, 
2005b, Shapiro et al. 2000, Gladdeers et al. 2002, Kneib et 
al. 2004). Through strong lensing it is possible to estimate 
the projected mass of a cluster from ten or few tens of kpc 
to about one hundred kpc. Weak lensing studies of galaxy 
clusters on the other hand allow to extend this range to the 
Mpc scale. Some work has been done to study the lensing 
distortions on CMB data by galaxy clusters. Most of these 
works focus on distortions of the primary CMB anisotropies 
or on angular scales larger than 1 arcminute (Hirata & Sel- 
jak 2003, Lewis & King 2006, Hu, De Deo & Vale 2007). 
Previous works have tried to estimate the lensing effect on 
smaller scales over secondary anisotropies such as the OV ef- 
fect (Zaldarriaga &: Seljak 1999). These works propose meth- 
ods to reconstruct the masses of galaxy clusters from their 
lensing signature. Due to its similarity, the reader may find 
interesting the works by Hu (2001) and Maturi et al. (2005a, 
2005b). In those papers the authors make use of filters for 
highlighting the lensing signals. In particular, Maturi et al. 
(2005a, 2005b) defines a filter which is formaly similar to 
the one used in this paper although they apply it to lensed 
galaxies. 

This paper will focus on the possibility of measuring 
the lensing effect due to high redshift clusters through ob- 
servations of the OV effect and its lensing distortions. We 
propose a new filter in sections 2 and 3, test it with sim- 



ulated data in section 4 and show the significance of the 
results for various sensitivities, resolutions and point source 
confusion levels in section 5. The same results could be also 
applied to potential observations of the reionization epoch 
by making high resolution maps of the 21-cm line at those 
high redshifts. We will present a fast and efficient algorithm 
for highlighting lensing signals in noisy data which could be 
used for estimating the mass of high redshift clusters. 



2 LENSING EFFECT 

The gravitational lensing effect can be quantified by the de- 
flection angle, a, created by the distorting mass (or lens) 
over the photons. A source which originally was located on 
a position j3 in the sky will appear on a different position 9 
at a distance a from the original position. 



/3 = 9-a(9,m(9)) 



(1) 



where a(9) is the deflection angle created by the lens which 
depends on the observed positions, 6. 

The deflection angle a at a given position is found by in- 
tegrating the (perpendicular) change in the 3D gravitational 
potential, $, along the line of sight; 
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Usually, the projected potential, ip, is used instead of $ 
2 Dis 



&dz 
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As a function of tp, the deflection angle is just, 



(4) 



In the above equations D ts , D t , and D s are the angular di- 
ameter distances from the lens to the source galaxy, the dis- 
tance from the observer to the lens and the distance from the 
observer to the source galaxy respectively. From the deflec- 
tion angle one can easily derive the magnification produced 
by the lens at a given position: 

_ ^ 9a x da y da x da y da x da y 
^ dx dy dx dy dy dy 

Given the gravitational potential if), the shear is defined in 
terms of the second partial derivatives of the potential %p 
(the Hessian of ip): 
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|(^ii - V22) = 7(0) cos[2<p], 
^12 = V"2i = 1(6) sin[2y>], 
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(7) 
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where 7(8) is the amplitude of the shear and tp its orienta- 
tion. The shear can be also expressed in terms of the deflec- 
tion angle, a, yielding 
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The amplitude and orientation of the shear are given by 



(10) 
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Figure 1. Shear distortion maps 71 (left) and 72 (right) for a 
cluster at the center (the cluster has arbitrary mass and rcdshift) . 
The images have been smoothed for representation purposes. 



Figure 2. Reduced shear maps g\ (left) and 32 (right) for the 
same cluster of figure 1. The images have been smoothed for rep- 
resentation purposes. 



7=V^F+7|, (11) 

V?=iatan( — ). (12) 
2 71 

In figure 1 we show the distribution of 71 and 72 for a clus- 
ter with a Navarro, Frenk and White (1997) profile (NFW 
profile hereafter) . When measuring shear distortions, the re- 
duced shear g is measured instead; 

- 7 



with the convergence k defined by; 



k = — 
2 
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Since 7 has two components, the reduced shear will have 
two components as well, gi and 32. In figure 2 we show 
the expected reduced shear signal (gi and (72) signal for a 
cluster (of arbitrary mass and redshift). The g± and g 2 maps 
show a distinctive pattern which divides the space in two 
regions. The division between the central region and the rest 
occurs when the convergence k = 1. This coincides with the 
tangential critical curve (Einstein radius) or region where 
the magnification is maximum. When crossing this line in a 
radial direction the nature of the lensing arcs change from 
tangential to radial (or viceversa). This can be appreciated 
by a change in the sign of both g\ and 32- The ring shaped 
distribution in g\ and <?2 will lose its symmetry if the lens 
is not symmetric. One expects the division between the two 
zones to follow the critical curve, defined by the region where 
the projected surface mass density equals the critical density. 



2.1 Quantifying shear distortions in CMB maps 

. Weak lensing analysis historically has been applied to op- 
tical images where a large number of background sources 
(galaxies) is observed and their ellipticities measured. By 
averaging the ellipticities of many galaxies (« 100) over a 
given area (~ 1 arcminute 2 ), an estimate of the reduced 
shear, g, is obtained in that area. Repeating the same pro- 
cess over different contiguous areas it is possible to give an 
estimate of the gravitational potential responsible for the 
measured average ellipticities. CMB maps are different be- 
cause one lacks individual images (galaxies) and therefore 
is not possible to define constrained images with well de- 
fined borders for which an ellipticity (or orientation) can 
be derived. Instead, one deals with continuous fields (the 



OV effect or field /) having a typical anisotropy scale of 
10-60 arcseconds. For this part of the discussion we will as- 
sume the field F consists only of pure OV effect signal (with 
lensing distortions). In a real experiment one will have also 
systematic noise and small scale contaminants such as point 
sources. We will discuss these later. The anisotropies of the 
OV effect have intrinsic ellipticities which introduce ran- 
dom correlated shears on these scales of 10-60 arcseconds. 
One expects the ellipticities of contiguous anisotropies (by 
contiguous we mean separated more ~ 1 arcmin) not to 
be correlated so the mean orientation of a sufficiently large 
area should average to (as it happens with the standard 
weak lensing of background galaxies). Given the scale of the 
OV effect this means we should average areas of many ar- 
cminutes in order to reduce the intrinsic shear scatter of the 
OV effect. This poses a huge problem for lensing analysis 
since the typical scale of lensing signals is a few arcmin for 
distant moderate mass clusters. Averaging the shear over 
many arcminutes would wash out any lensing signal from 
these clusters. We have to find a way to solve this problem. 
One option is to reduce the natural s cale of the OV effect 
(or field /). This can be done by working with the curvature 
map, Cf, of the field / instead. The curvature will reduce 
the large scale component of the OV effect and will enhance 
small scale fluctuations. Lensing signals will be minimally 
affected by this since the lensing stretches the anisotropies, 
therefore adding small scale features which will be enhanced 
when we compute the curvature of the map. The curvature 
of / can be defined as; 



dx 2 dy 2 



(15) 



We can anticipate here that when dealing with noise data, 
the curvature would be dominated by the pixel noise. This 
problem can be significantly reduced by filtering out the 
noise (for instance with a Gaussian filter) before calculating 
the curvature. We will discuss this point in more detail later. 

The orientations (or shear) of the curvature field, Cf, 
can be constructed from the first derivatives of the curvature 
field. Based on these derivatives it is possible to construct 
quantities similar to the Stokes parameters Q and U. 
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An associated intensity I c can be defined as; 



(16) 
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\/Qc + ul 



(18) 



For the shake of simplicity, from now on we will drop the 
suffix c from Q, U and I. We should notice the correspon- 
dence between the definition of the quantities (Q, U) and 
(71, 72)- Like in the case of 71, Q is positive when the shear 
is aligned vertically while Q is negative when the shear is 
aligned horizontally. In the case of U the situation is simi- 
lar but with a 45 degrees rotation (like in the case of 72). 
The U parameter is maximum when the features are aligned 
over the 45 degrees direction and minimum when they are 
aligned over the -45 degree direction. 

For speed purposes it is convenient to compute the cur- 
vature map Cf in Fourier space. 



Cf 



-k 2 f 



(19) 



where tilde denotes Fourier transforms, k — (k x ,k y ) is the 
Fourier k-mode (or wave vector) and / is the original data. 

Also, the first derivatives of the curvature map can be 
computed in Fourier space. 



C' x — ik x Cf 



C'y 



ikyCf 



(20) 
(21) 



The Stokes parameter fields Q and U are obtained after 
Fourier transforming back to real space the complex maps 
C' x and C'y and substituting them in equations (16) and 
(17). Computing the derivatives this way has the advantage 
of being faster but it has the inconvenience of being affected 
by border effects (due to periodical boundary conditions in 
the Fourier transform). We can remove border effects by 
neglecting the areas near the image borders. 



3 OPTIMAL FILTERS FOR LENSING IN CMB 
DATA 

The chances of detecting a weak signal embedded in a noisy 
background (in this case, the shear distortion created by a 
cluster in the CMB) can be greatly enhanced by using specif- 
ically tailored linear filters. The linear filters that maximize 
the signal to noise ratio of a given signal are known as lin- 
ear matched filters (often referred to as optimal filters). A 
matched filter is obtained by correlating a known signal, or 
template, with the given data in order to detect the pres- 
ence of the template in the data. In presence of pure white 
noise, the shape of the filter is the same as the shape of the 
signal to be detected. However, when the noise has spatial 
correlations the noise covariance matrix (or, equivalently, its 
power spectrum) must be taken into account. The general 
form of the matched filter is proportional in Fourier space 
to the template of the signal over the power spectrum of the 
background 

Matched filters have been widely used in engineering 
and data analysis for decades (Turin et al. 1960) and in 
many areas of Astronomy, including detection of galaxy clus- 
ters in X-ray surveys (Vikhlinin et al. 1995, Stewart et al. 
2006), in three-dimensional redshift surveys (Kepner et al. 
1999), detection of extragalactic point sources in CMB maps 
(Lopez-Caniego et al. 2006) and detecion of themal Sunyaev- 
Zel'dovich effect (Herranz et al. 2005, Melin et al. 2006). 
Besides, matched filters have been used for the detection 



of galaxy clusters through weak lensing in (Maturi et al. 
2005a, 2005b). Bernstein & Jarvis (2002) presented optimal 
techniques to measure the weak gravitational shear from im- 
ages of galaxies. In these works a matched or optimal filters 
for the shear were constructed. In this work we will focus in- 
stead in designing matched filters for the Stokes parameters 
Q and U of the curvature field. 

After decomposing the curvature map into its Q and 
U components (using the Fourier technique) we can apply 
an appropriate filter to highlight the expected lensing signal 
from a galaxy cluster. Optimal filters are generally defined 
(in Fourier space) as the signal (theoretical) we want to de- 
tect divided by the power spectrum of the background. The 
theoretical lensing signal can be computed numerically for 
a given profile calculating the deflection angles a x and a y 
and then the reduced shear g — (51,32). The power spec- 
trum of the background can be obtained directly from the 
data (in our case the power spectrum of the Q and U maps). 
The negative is that we need to assume a cluster profile in 
order to compute g. This problem can be partially solved if 
we adopt a modified version of the definition of g. We will 
assume the reduced shear can be well described by; 



gi = 71 x (l + K ) 
32 = 72 x (1 + k) 



(22) 
(23) 



We use the hat notation to distinguish between the formal 
definition of gi and 32 (see equation 13). In the weak lensing 
limit where k << 1 equations (22) and (23) are similar to 
equation (13) (to first order in k). The above equations will 
fail in describing the correct shear where k ~ 1 or k > 1. 
This range corresponds to the strong lensing case. Our new 
definition for gi and 32 will have no negative consequences if 
the strong lensing part of the data is omitted. This is exactly 
what we are going to do. By omitting the strong lensing part 
of the data we avoid two important problems. On one hand, 
the Einstein ring shown in figure 2 disappears since with the 
new definition there is no change in sign when k « 1. This 
will make our results less insensitive (and therefore more 
robust) to any assumption about the specific profile of the 
cluster. On the other hand, by excluding the central region 
of the cluster we avoid some of the most serious contam- 
inants, in particular the kinetic Sunyaev-Zel'dovich effect 
due to internal bulk motions of the gas in the cluster and 
also mm or sub-mm extragalactic sources. The kinetic SZ 
can not be removed with multifrequency observations (like 
the thermal SZ) since it does not depend on the frequency. 
Point sources within the galaxy cluster will be unresolved at 
the resolutions we are considering here (fwhm > 5 arcsec) 
and they can be very close to each other making the use 
of matched filters even harder due to their blending. One 
way to avoid the central region of the cluster is by means of 
a damping function. This leads us to the final form of the 
theoretical signal that we will assume for constructing the 
optimal filter. 



gi = (l-G(a)) 2 X71 x + 
32 = (1 -G(a)f x 72 X (1 + k) 



(24) 
(25) 



where G(a) is a Gaussian function with dispersion a. The 
above definition introduces a scale a. The role of this scale 
is somehow similar to the Einstein radius but with the dif- 
ference that we only require that a is of the order of the 
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Figure 3. Optimal filters Fq (left) and Fu (right) obtained for 
one of our simulations. Note how the intensity goes to at the 
center as a consequence of the damping function. The oscillations 
are typical of matched filters. 
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Einstein radius (a factor 2 difference does not make a big 
difference as we will see below). Although we are not going 
to discuss about alternative choices for g\ and §2 we want to 
note that simplified versions of the reduced shear gave also 
good results when applied to our our simulated data. In par- 
ticular, due to its simplicity we can emphasize the following 
one; 



51 = (1 - G(a)) 2 x X 



g 2 = (l-G(a)) 2 x 



x 2 + y 2 
x 2 + y 2 



(26) 
(27) 



where x and y are the distance to the center of the cluster. 
This form reminds one of the kernel proposed by Squires & 
Kaiser 1996) although the use of that kernel and our filter 
is different in each case. 

In figure 3 we show the filter for a — 0.3 arcmin (field 
of view of 12 arcmin). If the contaminants can be accounted 
for and removed, then the strong lensing signal could be 
measured as well. In this case, filters like g\ and g% (see 
figure 2), may be more appropriate. 

Once the expected signal has been denned, the opti- 
mal filters (one for Q and one for U) are constructed in a 
standard way. 



Fq = 



Fu 



gi(fc) x ft(*0 

g 2 (k) x b(k) 
Pu(k) 



(28) 



(29) 



Where as above tilde denotes Fourier transform. The term 
b(k) is the Fourier transform of the kernel used to filter out 
the noise. We will discuss this factor in more detail later but 
we prefer to include it here for completion. We use a Gaus- 
sian kernel for b with a scale larger than the resolution of 
the experiment. Pc}{k) and Pu(k) are the power spectrum 
of the Q and U maps respectively (original data — > filter 
noise — > curvature — > 1st derivatives — > Q and U). 
A very similar philosophy has been followed in the past to 
define optimal filters for weak lensing optical surveys (Mar 
turi et al. 2005b). Although there are some differences, our 
filter and theirs are closely related. Some differences are the 
use of the damping function in our filter to reduce cluster 
contamination or the use of the total (background) power 
instead of the noise power spectrum in their definition of 
the filter. Alos, in their work the authors aopply their filter 



Figure 4. Fraction of ionized gas as a function of redshift. 

to simulated optical data (with background galaxies acting 
as the source plane for lensing) . 

4 SIMULATIONS 

Testing the algorithm with simulations is essential, not only 
to prove its feasibility but also to identify its failures and 
weaknesses. We first simulate the OV effect when CMB pho- 
tons cross the first newly re-ionized regions at high redshift. 
After the universe recombined at redshift z w 1100, it un- 
derwent a period know as the dark ages where the small den- 
sity perturbations (of the order of 10—5 at redshift z=1000) 
started to grow and form larger structures. During this pe- 
riod the Universe was neutral and transparent to CMB pho- 
tons. At some point around redshift z ~ 20 it is believed that 
the first stars and/or quasars re-ionized the medium around 
them (UV radiation). As the times passed this newly re- 
ionized zones started to grow and merge forming larger and 
larger ionized areas. Free electrons in ionized regions would 
re-scatter a fraction (~ 10%) of the CMB photons (inverse 
Compton scattering). Due to the low energy of CMB pho- 
tons the scattering is elastic and isotropic (the scattered 
photon can leave the interaction in any direction). However, 
the CMB photon suffers a Doppler shift in its spectrum due 
to the electrons motion. The distortion in the spectrum of 
the CMB photons is then proportional to the bulk velocity 
of the free electrons and their optical depth, r. 

%~±r (30, 

with v r the radial component of the electron bulk veloc- 
ity (or projection along the line of sight). As mentioned in 
the introduction, the OV effect is similar to the kinetic SZ. 
However, the later is commonly used when we refer to hot- 
ionized plasma in galaxy clusters while the OV effect focuses 
on ionized regions in much larger scales. In the introduction 
we mentioned also that the equivalent of the thermal OV ef- 
fect can be considered negligible since the temperature of the 
intergalactic medium (IGM) is much smaller than in galaxy 
clusters (T IG m ~ 10 4 K, Haehnelt & Steinmetz 1998, Hui 
& Haiman 2003). Although the Universe is also ionized at 
low redshift, this distortion is expected to be dominated by 
the high redshift Universe because it is denser by a factor 
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(1 + z) 3 . The interaction between the free electrons and the 
CMB photons will be then more likely at high redshifts. To 
simulate the OV effect we use the public code GADGET-2 
(Springer et al. 2005). We make 6 different simulations of 
50 Mpc (comovil) cubes each and with more than 2 mil- 
lion particles per cube. The simulation is started at redshift 
z « 40 until z ex 3. When projected, the cubes always have 
a scale larger than 20 arcmin for any redshift. We adopted 
a stacking procedure to project the simulations along the 
line of sight. We stack 45 projections of 12 x 12 arcmin 
each (50 Mpc deep). This give us a range in redshift be- 
tween Zmi n — 4.6 and z ma x = 20.7 (assuming Q m — 0.27 
and A = 0.3). This range of redshifts guarantees that we 
do not miss the beginning or end of the reionization epoch 
(5 < z < 20). Since we have only 6 independent simulations 
and in order to avoid correlated repetitions of the same pro- 
jected zone, we need to rotate and/or take different patches 
or quadrants (of 12 arcmin each). We do not impose con- 
tinuity conditions in the overlapping zone of the boxes but 
we expect this to be a minor systematic effect given the di- 
mensions of the cubes. We assumed a patchy reionization 
and with a reionization history starting at redshift z ~ 20 
and leading to a fully ionized Universe at redshift z m 10 
(see figure 4). This ionization history is compatible with the 
upper limit of the latest WMAP constraints (Page et al. 
2006). We assumed that high density regions were ionized 
first (Sokasian et al. 2003). Taking a homogeneous reion- 
ization instead reduces the OV power at small scales by a 
factor 2 or 3. The OV effect is computed by projecting along 
the line of sight the radial velocity times the optical depth 
(which depends on the reionization history). The final map 
shows features at small and large angular scales. The RMS 
peaks at scales between 20 arcseconds and 1 arcmin (see 
figure 5) with values of RMS w 5fjK. When compared with 
the CMB power spectrum, the OV spectrum dominates at 
scales smaller than 2 arcmin (assuming the constribution 
from instrumental noise, point sources and galaxy clusters 
can be removed). This results lay on the high end but are 
still comparable to previous estimations of this effect (Jaffe 
& Kamionkowski 1998). Our simulations are simplistic in the 
sense that we do not take into consideration all the physical 
processes involved in the ionization of the IGM. Instead we 
reduce all the physics to the ionized fraction. However, these 
simulations will suffice for the purpose of this paper. The fi- 
nal map of the OV effect is used as a background template 
for computing the lensing effect of clusters at redshift 2 = 1. 
We use the public package WSLAP for the computation of 
the lensing effect (Diego et al. 2004 etc) The resulting map 
is convolved with a Gaussian kernel to simulate the exper- 
imental beam and then is pixelized. We take 3 pixels per 
resolution element (fwhm). White Gaussian noise is added 
afterwards to account for instrumental noise. We will con- 
sider different noise levels and resolutions (4.2, 8.5 and 17 
arcsec fwhm). More details on the resolution and noise levels 
will be given later. As discussed above, in order to reduce 
the intrinsic shear of the OV effect we work with the curva- 
ture map instead. However, before computing the curvature 
the noise has to be filtered out since otherwise pixel to pixel 
variations due to the instrumental noise would dominate the 
curvature. The curvature map is then used to derive the Q 
and U maps of the data. These two maps are then optimally 
filtered and added to render the final result. In figure 6 we 
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Figure 5. Power spectrum of the simulated OV effect (dashed 
line) compared with the power spectrum of the CMB (solid line). 

show an example for a cluster at redshift z = 1 and mass 
M — 4 x lO 14 ft~ 1 M . The noise level was 3/iK per pixel 
and the beam of the experiment had a fwhm — 8.43 arcsec. 
The combination of Q and U filters produce a clear detec- 
tion with a significance of 5<r. The RMS a is computed in 
the area outside the white circle in the right panel. The bor- 
ders have been ignored because of systematic effects due to 
computations in Fourier space. It is interesting to note how 
the filter does not highlight features that at first can look 
like lensing effects. For instance, there is a round structure, 
similar to the lensing signal, on the top left side of the im- 
age. This structure is not enhanced at the same level as the 
real lensing signal. 



5 RESULTS 

The simulated data has to be pre-processed as we mention 
earlier. The Q and U maps are computed over the curvature 
map of the data. However, the local curvature would be 
completely dominated by the instrumental noise if we do 
not attempt to filter it out first. For this purpose we apply 
a Gaussian filter with a dispersion a g which depends on 
the level of noise. For white Gaussian noise the RMS of the 
filtered noise decreases as l/a g . Depending on the noise level 
we can choose the appropriate a g so the noise level of the 
filtered map does not dominate the curvature of OV effect. 

The results will depend on the characteristics of the 
experiment. In particular its sensitivity and resolution but 
it will also depend crucially on other contaminants like the 
Sunyaev-Zel'dovich effect in galaxy clusters (thermal and ki- 
netic) and more importantly mm and sub-mm sources. The 
thermal Sunyaev-Zel'dovich effect can be removed if several 
frequencies are available since this effect follows a well es- 
tablished frequency dependence. The kinetic SZ on the other 
hand is frequency independent but it is proportional to the 
thermal SZ (at least the component due to cluster bulk mo- 
tion) so it can be also partially removed. However, the re- 
moval might not be perfect and some residuals may be left 
specially in the central part of the cluster where energetic 
phenomena can create wave fronts or fast internal motions 
of the gas difficult to extract (Mathis et al. 2005) . However, 
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Figure 6. The panels show the original simulated OV effect in a 12 X 12 arcmin 2 field (top-left). There is a cluster at redshift z = 1 and 
with mass M = 4x 10 14 h -1 Mq near the center. The cluster potential distorts the space around it creating distinctive lensing patterns. 
The middle panel shows the same map after being observed with an experiment having a resolution of 8.43 arcsec fwhm and a noise level 
of per pixel (or cquivalcntly 0.15/iK per arcmin 2 .) The right panel shows the final result obtained after combining the optimally 
filtered Q and U maps. This cluster is at the limit of dctectability (SNR f» 5) 



as mentioned earlier, our strategy to deal with these con- 
taminants is neglecting the central regions of galaxy clusters 
by using a damping function in the center of the filter. Our 
most worrying source of contamination will be point sources 
but before studying their effect we will focus on the other 
aspects affecting the results. 

5.1 Effects of the scale a 

First we study the effect of the damping function discussed 
above. The role of this scale can be identified as the mass of 
the cluster. Increasing the scale a changes the optimal filters 
in a way that makes bigger scales more relevant. It also has 
an effect on the peaks of the signal. A large a removes more 
contributions coming from the central part of a cluster but 
it also removes contributions coming from small scale fluctu- 
ations in the curvature map. Hence, if there are still signifi- 
cant small scale contributions to the curvature coming from 
instance from instrumental noise or point sources, the damp- 
ing function helps to remove these signals. It is important to 
note that the effect of the smoothing function (used to filter 
out the noise before computing the curvature) is similar to 
the damping function. When a smoothing kernel is applied 
over the theoretical shear, gi and <?2, it softens the peak 
compensating the positive and negative contributions at the 
center. One could then be tempted to remove the damping 
function from the definition of the optimal filter but this 
would have a negative effect on high signal-to-noise experi- 
ments where the noise does not suppose a serious threat. In 
this case, filtering out of the noise is not necessary but still 
a damping of the filter in its center is recommended to re- 
move contaminants and reduce contributions from isolated 
peaks in the curvature map. In order to evaluate the effect 
of a we will consider experiments with low noise (otherwise 
the effects of noise filtering can dominate the effect of the 
damping function). We find that a change in a of almost 
an order of magnitude renders basically the same number of 
detections (SNR > 5). Values of a much smaller than the ex- 
pected Einstein radius of a cluster produce results which are 



very sensitive to isolated peaks in the curvature map. On the 
other hand, values of a much bigger than a typical Einstein 
radius produce maps which are too diluted. Values for a in 
the range 20 — 60 arcseconds produce in general satisfactory 
results. The small end will be more suitable for distant lower 
mass clusters while the higher end is more appropriate for 
detecting nearer massive clusters. 

5.2 Effects of instrumental noise 

We assume that instrumental noise can be well described 
by a Gaussian distribution with dispersion on- We also as- 
sume that the noise is uncorrelated (white noise). In order 
to compare sensitivities the noise is normally expressed as 
fiK per arcmin 2 . We will follow the same notation and give 
our noise levels in the same units although in some cases it 
will be useful to express the noise in fiK per pixel. At the 
scales of the resolution elements of our simulated data (4.2, 
8.5 and 17 arcsec) the signal we are simulating (the OV ef- 
fect) has an RMS of a few /jK. The noise level should be 
of the same order or below the signal level itself (otherwise 
the experiment would be noise dominated and we would not 
see any signal at all). This means that the noise per pixel 
should not be more than a few /jK. This noise level per pixel 
translated into pK per arcmin 2 gives w 0.1/iK per arcmin 2 . 
This sensitivity is, for today's standards, about only one or- 
der of magnitude better than current experiments. In order 
to make a statistical analysis we make 100 simulations for 
each case varying the noise (but keeping crjv) and position 
of the cluster in each one. After taking different noise lev- 
els and applying the formalism discussed above we compute 
the SNR (for the 100 cases) as the ratio between the filtered 
signal at the central position of the cluster and the RMS 
computed in the area excluding the cluster. The result as 
a function of noise level can be seen in figure 7. We plot 
SNR and its dispersion (from the 100 simulations) for two 
cluster masses at redshift z = 1. In this result, the simu- 
lated experiment had a resolution of fwhm = 8.5 arcsec. 
From the plot we see that for noise levels under 0.1/iK per 
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Figure 7. SNR as a function of noise level for two cluster mass. 
The clusters are at redshift z = 1 and masses are given in units 
of /i _1 Mq. Error bars correspond to la. 



arcmin it is possible to detect a significant fraction of clus- 
ters down to masses of M — 2 x 10 14 /i -1 M©. At noise levels 
of 0.5/iK per arcmin 2 approximately half the clusters with 
masses M = 6 x 10 14 h~ 1 MQ can be detected at these resolu- 
tions (in the absence of other contaminants) . This mass is ex- 
pected to be the largest cluster mass at z = 1. On the other 
hand, most of the low mass clusters (M < 2 x 10 14 /i _1 Mq) 
will be undetected with this noise level and resolution. A 
similar result is shown in figure 8 but this time fixing the 
noise level and varying the mass. Two noise levels are shown 
in the plot. The clusters are again assumed to be at z = 1. 
Both plots show larger variations in the SNR with increasing 
mass (even for smaller noise levels). The error bars depend 
on three factors. First the noise level, second the background 
OV and third the fact that we use an optimal filter which 
depends on the background. The smaller contributions to 
the variation in the SNR are the noise level and the change 
in background which enters the definition of the optimal fil- 
ter. Most of the variation in the SNR comes from the OV 
background behind the cluster. A background rich in fea- 
tures and big changes in intensity will produce sharper arcs 
which will be more easily identified in the curvature map. 
On the other hand, if the OV background behind the cluster 
is soft and poor in features, the lensing effect will show wide 
gradients in temperature rather than well defined arcs. This 
second case will be much more difficult to detect through 
the shear distortions. Bigger clusters have a larger lensing 
cross section. Hence they are sensitive to a wider range of 
scales and consequently their SNR can change more. 



5.3 Effects of spatial resolution 

Spatial resolution plays an important role on the detectabil- 
ity of this effect. As discussed in the last paragraph in the 
last subsection the detectability of a cluster depends very 
much on the OV background behind the cluster. If the back- 
ground is rich in structure the cluster will be detected more 
easily while if the background has small variations in inten- 
sity it will be more difficult to measure shear distortions. 
The beam of an experiment has two negative effects in lens- 
ing. On one hand it rounds the anisotropies (if the beam 
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Figure 8. SNR as a function of cluster mass for two levels of 
instrumental noise. The upper curve is for a noise level of 0.05/iK 
per arcminute 2 and the lower curve is for a noise level of 0.3/iK 
per arcminute 2 . The cluster is assumed to be at redshift 2 = 1 
and the experiment has a resolution of 8.5 arcsec per beam. 



is symmetric) reducing the shear of the anisotropy. On the 
other hand, it softens the anisotropies, removing variations 
in intensity smaller than the beam resolution. The ideal ex- 
periment will have a beam size smaller than the range where 
the RMS of the OV anisotropies is maximum (20 — 60 arc- 
sec). In this case we will consider three beam sizes; 4.2, 8.5 
and 17 arcsec. Larger beams will dilute the OV anisotropies 
hence making harder to detection of lensing signals. Also, 
larger beams makes the issue of point source confusion a 
more serious one. On the other hand a small beam is tech- 
nologically more challenging since a larger collecting area is 
required and the sensitivity has to be increased accordingly 
with the resolution. In figure 9 we show the effect of chang- 
ing the spatial resolution of the experiment while keeping 
its sensitivity (note that the pixel per noise must be differ- 
ent in each case to maintain the sensitivity) . Again we show 
the results for two different masses. Our results show that, 
in the range of resolutions relevant for the OV effect, the 
beam resolution of the experiment does not seem to have 
a large impact on the final results (although larger beams 
would start to reduce the SNR drastically) . The explanation 
for why the resolution does not seem to affect the result is 
because we are setting the noise level per arcmin 2 the same 
for the three cases (resolutions). The same noise level per 
arcmin 2 implies however a higher noise level per pixel in the 
higher resolution experiment. This means that filtering out 
the noise has to be done with a Gaussian which in absolute 
terms is equivalent in the three cases. We could say that at 
the end the effective resolution is determined more by the 
noise level than by the experimental beam. Since in figure 9 
the three experiments have the same noise level, the result 
is basically the same in all three cases. 

5.4 Effects of point sources 

Finally we explore the most serious source of systematic er- 
ror; galactic sources. To asses their importance we simulate 
a population of point sources assuming they are observed 
at 1.2 mm (250 GHz). Due to the lack of observations at 
these wavelengths, little is known about the number counts 
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Figure 9. SNR as a function of the experiment resolution. The 
noise level per arcminute is the same for the three simulated data 
sets. The upper curve shows the SNR for a clusters with M = 
bW^h^MQ while the bottom curve is for clusters with M = 
2l0 14 h' 1 M Q . Also shown arc the 1 — a error bars. 
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Figure 10. Integrated number counts for the two models used 
to simulate the point sources. The lack of data below S ~ 1 
mjy at these frequencies makes necessary to consider extrapola- 
tions. Data points are observations made with MAMBO at 1.2 
mm (Greve et al. 2004). 



in this range. Sub- mm and mm observations are still in their 
earlier stages. One of today's most precise measurements of 
the number counts of galaxies at 1.2 mm wavelengths comes 
from the MAMBO experiment (Smail, Ivison & Blain 1997, 
Hugues et al. 1998, Barger, Cowie & Sanders 1999, Bertoldi 
et al. 2000, Eales et al. 2000). A deep survey with MAMBO 
(Greve et al. 2004) revealed that number counts at 1.2 mm 
where about one order of magnitude less than at 850 fxm 
(SCUBA number counts). This drop in the number counts 
is partially due to the fact that mm sources are generally 
dimmer when wavelength increases. In particular, combin- 
ing MAMBO and SCUBA sources one finds that the ratio of 
fluxes S , 850M"i/ , S'i200/jm is about a factor three (Greve et al. 
2004). At even longer wavelengths one expects that num- 
ber counts are even smaller. This suggests that observing 
at frequencies around 100 GHz might be optimal to reduce 
the negative effects of point sources. To simulate the point 
sources at 1.2 mm we use the MAMBO number counts of 
Greve et al. (2004) . In the same paper the authors compare 
their number counts with models having different galaxy 
evolution. These models are compatible with both SCUBA 
and MAMBO number counts. Due to lack of information be- 
low 1 mjy (at 250 GHz or equivalently 1.2 mm), we extrap- 
olate the model to lower fluxes to account for the dimmer 
sources. We consider two different extrapolations that are 
compatible with the observed number counts. In figure 10 we 
show the integrated number counts of our two models. The 
flattening of the integrated number counts at low fluxes in 
figure 10 is predicted by simulations (Fardal et al. 2001). In 
our calculations, we assume that we can remove all sources 
above a given flux. Sources need to be subtracted before 
filtering out the noise and computing the curvature of the 
map. In this paper we assume the sources can be removed 
perfectly. This is an optimistic assumption as subtraction 
algorithms always leave a small residual. It is beyond the 
scope of this paper to study the effects of these residuals 
here and we will focus only on the negative effects of the 
sources which can not be removed. For the shake of simplic- 
ity, our sources are simulated assuming that they are not af- 
fected by lensing distortions. This is approximation is valid 



if the angular resolution of the experiment is substantially 
larger than the typical size of galaxies. With the resolutions 
considered in this work (fwhm ?a 8 arcsec) all our sources 
will be unresolved. We should notice that by doing this ap- 
proximation we are being conservative since sources might 
actually help to highlight lenses since they could introduce 
shears in the maps. However, we will not consider this case 
in this work. In order to check the effect of point sources 
we will add a population of sources up to a given flux cut. 
All sources above this flux cut are assumed to be removed 
from the map. Figure 11, summarizes our findings for three 
different cuts in flux. Our result predicts that sources are a 
major contaminant and that they need to be removed down 
to the microJy level (at 1.2 mm). This will be a serious issue 
which needs to be investigated further but not in this paper. 
For instance, multifrequency observations might be required 
to reduce the level of contamination of mm sources. 



6 DISCUSSION OF THE RESULTS 

Our simulations show how an experiment with a resolu- 
tion of fwhm w 8 arcsec and sensitivity of ajv ~ 0.2/iK 
arcmin~ 2 could detect high-redshift galaxy clusters through 
their lensing signature on the OV anisotropies. Experiments 
like ALMA will be soon near this limit. Current experiments 
like the Atacama 25-m telescope (ACT) can reach sensitiv- 
ities of « 2/iK with beams of 1.7 arcminute or less at mm 
frequencies (Kosowsky 2003). The main limiting factor to 
detect distant clusters through OV-lensing will be our abil- 
ity to subtract point sources. Our simulations show how one 
needs to subtract sources down to a few micro Jy (at A = 1.2 
mm) in order to observe the OV anisotropies and their lens- 
ing distortions. In the flux range 1-3 microJy (or 2-6 /iK per 
beam) one expects nearly one source per beam {fwhm w 8 
arcsec). Although the microJy level may seem unreachable, 
we have to notice that sources above 3 microJy flux can 
be seen as « 5a fluctuations with the sensitivities consid- 
ered in this paper (0.2/ik arcmin~ 2 ). Therefore it should be 
straightforward to subtract them. Also, even for lower flux 
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Figure 11. Signal-to-noise ratio as a function of the exclusion 
limit for point sources. The two curves correspond to the two 
models A and B of figure 10 



sources, the fact that their flux vary with frequency (while 
the OV effect does not) can be used to develop subtraction 
algorithms for multifrequency maps (Herranz et al. 2002). 
These algorithms can also use information about the shape 
of sources (which at these resolutions will be unresolved) 
to define a matched filter. Another limiting factor will be 
the sensitivity of the instrument. We estimate that sensitiv- 
ities below 0.2/xk arcmin~ 2 are necessary in order to reli- 
ably detect lensing signals. This level of sensitivity is about 
one order of magnitude better than current experiments. 
ALMA for instance expects sensitivities (in the continuum) 
of about 40 microjy with only 60 second integration in the 
frequency band 125-163 GHz (Wootten 2002). Longer inte- 
gration times would increase the sensitivity thus allowing to 
observe the OV anisotropics (6 ks would increase the sen- 
sitivity one order of magnitude to 4 microJy). Regarding 
the spatial resolution of the experiment we found that this 
is not as important as long as it is better than ~ 10 arc- 
sec. Smaller beams though, will allow a better subtraction 
of point sources. These kind of resolutions can be obtained 
with large single dish telescopes (LMT) or interferometers 
(ALMA). Achieving these sensitivities and resolutions will 
open a new window for cluster studies, allowing the detec- 
tion (and mass estimation) of high redshift clusters even 
before they can be observed by other means. This is partic- 
ularly interesting since at high redshift the geometric factor 
in the lensing signal decreases more slowly than at lower 
redshifts. For example, increasing the redshift of the cluster 
from 2 = 1 to z = 4 decreases the amplitude of the lensing 
signal only by a factor 2. 



where k is the wavevector in Fourier space, e(k) is the 
Fourier transform of the ellipticity and x(k) is the opera- 
tor. 



Xi(*) = 



Xa(*) 



~~ ^2 

2fcife 2 

W 



(32) 



(33) 



Taking Q and U as our estimates of the ellipticities and 
applying equation 31 renders results very similar to the ones 
presented in this work after the Q and U maps are averaged 
over scales of 20-30 arcsec. We should notice that while our 
filter acts as a convolution over the Q and U maps, the 
Kaiser and Squires operator acts as a combination of second 
derivatives of the Q and U maps. 



7 CONCLUSIONS 

The OV effect can create anisotropies of a few fiK on scales 
of 10 — 30 arcseconds. A galaxy cluster can distort an OV 
effect background creating a distinctive lensing signature. 
This distortion produces a shear of the OV anisotropies 
which can be highlighted with an appropriate filter. We have 
seen how an optimal filter can be built from the Stokes pa- 
rameters of the curvature map of the image. We test the 
potentiality of the filter to highlight lensing signals with 
simulated data. The simulated data includes different levels 
of instrumental noise, angular resolution, and point source 
contamination. We study how the instrument characteris- 
tics and point source contamination affect the result, find- 
ing that point sources might be the most serious problem 
and that it will be necessary to subtract them down to the 
microjy level at mm wavelengths. The effort to go down to 
this level should be rewarding. These type of studies would 
open a new and exciting window to explore cluster forma- 
tion, since it would allow to detect and estimate the mass 
of high redshift cluster through their shear distortions. A 
more careful analysis of the lensed images could predict the 
mass of the cluster with better accuracy. For instance, the 
Einstein radius could be easily identified in many cases since 
a continuous background guarantees that there will always 
be photons originating at the caustics. 
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6.1 Alternative choices for the filter 

It is interesting to notice that there is still room for im- 
provement. For instance, we have presented results for an 
optimal filter but other choices could render similar results. 
An interesting alternative is the operator defined by Kaiser 
and Squires (1993). In equation (2.1.12) of that paper they 
derive the Fourier transform of the mass surface density as; 

E(fc) = - Xi (fc)e;(fc) (31) 
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